DBTSS : DataBase of Human Transcriptional Start Sites and Full-Length cDNA
نویسندگان
چکیده
Although a large number of new genes were predicted from the draft sequence of human genome, the need in cDNA analyses is still unchanged in order to locate correctly the position of genes and to examine their regulatory regions. First, correctly determined transcriptional start sites (TSS) position of a gene enables us to examine its regulatory region more precisely. It is reported that one gene can has multiple TSS, and this phenomena can be explained either by the differential gene expression or rather loose specification of TSSs [1]. In any case, it would be intriguing to find correlations between TSSs and their upstream sequences. Another importance of determining TSSs is that it helps to predict precisely the full-length (or maximum, at least) coding sequence, which is essential to examine the presence of N-terminal sorting signals, for instance. Positional data of transcriptional start sites (TSSs) and full-length cDNA is an indispensable source of biological information. To obtain the full-length cDNAs, we have developed the ‘oligo-capping’ method [2, 3], which is a 5’-end sequencing of full-length cDNAs. On the base of it, novel database, DBTSS [4] presented in this study was constructed. DBTSS will be useful not only as a full-length version of the RefSeq database [5], but also as a resource for analyzing regulatory information in a variety of expression conditions.
منابع مشابه
DBTSS: DataBase of human Transcriptional Start Sites and full-length cDNAs
Although the information of cDNAs is indispensable for analyzing gene function, most of the cDNA sequences stored in current databases are imperfect in the sense that they lack the precise information of 5' end termini. To overcome this difficulty, we have developed the oligo-capping method to obtain full-length cDNAs, the information of which has been partly deposited in public databases. In t...
متن کاملDBTSS: database of transcription start sites, progress report 2008
DBTSS is a database of transcriptional start sites, based on our unique collection of precise, experimentally determined 5'-end sequences of full-length cDNAs. Since its first release in 2002, several major updates have been made. In this update, we expanded the human transcriptional start site dataset by 19 million uniquely mapped, and RefSeq-associated, 5'-end sequences, which were generated ...
متن کاملDBTSS: DataBase of Transcriptional Start Sites progress report in 2012
To support transcriptional regulation studies, we have constructed DBTSS (DataBase of Transcriptional Start Sites), which contains exact positions of transcriptional start sites (TSSs), determined with our own technique named TSS-seq, in the genomes of various species. In its latest version, DBTSS covers the data of the majority of human adult and embryonic tissues: it now contains 418 million ...
متن کاملComplex Network Approach to Human Promoter Sequences
Based upon the correlation matrix of the human promoter sequences, a complex network is constructed to capture the principal relationships between these promoters. It is a complex network has the properties of the right-skewed degree distribution and the clustering simultaneously, i.e., a hierarchical structure. An eigenvector centrality (EC) based method is used to reconstruct this hierarchica...
متن کاملCollection and Analysis of Eukaryotic Promoter Regions: DBTSS (DataBase of Transcriptional Start Sites)
Recent determination of human and mouse draft genome sequences should be the landmarks for the post-sequencing era. One of the major challenges in this era is the interpretation of the promoter regions. To this end, precise identification of the transcriptional start sites (TSSs) is essential. However, such an information cannot be obtained from usual cDNA or EST data. Although Eukaryotic Promo...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2001